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Abstract 

We propose in this work to derive a CLT in the functional linear regression model. The 
main difficulty is due to the fact that estimation of the functional parameter leads to a kind of 
ill-posed inverse problem. We consider estimators that belong to a large class of regularizing 
methods and we first show that, contrary to the multivariate case, it is not possible to state 
a CLT in the topology of the considered functional space. However, we show that we can 
get a CLT for the weak topology under mild hypotheses and in particular without assuming 
any strong assumptions on the decay of the eigenvalues of the covariance operator. Rates 
of convergence depend on the smoothness of the functional coefficient and on the point in 
which the prediction is made. 

Keywords: Central limit theorem, Hilbertian random variables, functional data analysis, co- 
variance operator, inverse problem, regularization, perturbation theory. 

1 Introduction 

For several years, there has been a considerable interest in Functional Data Analysis. Indeed, a 
consequence of advances in technology is the collection of many data sets on dense grids (image, 
satellite, medicine, ...) adding in some sense more and more information. The question is then: 
can we do something specific with this new information ? It is the merit of the books by Ramsay 



and Silverman (1997, 2002) to have prepared the ground for answers to this question. They, and 
other authors after them, have shown the practical benefits of using ad hoc statistical methods 
for these data sets: the point of view is clearly to take into account the functional nature of 
the data. This means that one considers the data as objects belonging to functional spaces 
with infinite dimension and one has to use adapted probabilistic and functional analysis tools 
to derive properties of estimators in such a context. 

This emulates a need for developing theoretical/practical aspects on the ground of functional 
data analysis. It is the aim of this paper to contribute to this kind of development. The 
framework of our study is itself an important part of functional data problems. We are interested 
in the properties of the linear regression model in this functional framework, that is to say 
performing the regression of a real random variable on a functional variable. The two main 
motivations of this work are to study rigorously the asymptotic distribution of the estimator 
of the regression function and from a statistical point of view to deduce asymptotic confidence 
intervals for prediction based on functional linear regression. 

This kind of model is not new and has many potential applications such as Chemometrics 
as it can be noticed in the paper by Prank and Friedman (1993). Whereas chemometricians 
have mainly used adaptations of statistical multivariate methods, functional procedures have 
gained in popularity more recently as said above. For instance Hastie and Mallows (1993) have 
raised, in the discussion of the paper by Frank and Friedman (1993), the question of functional 
alternative methods. Thus, for this case of estimating a regression, two main approaches have 
been considered, (1) estimating the functional linear regression which is a "continuous" version of 
linear regression when the covariate is a vector of scalars and was first introduced in Ramsay and 
Dalzell (1991) and (2) proposing a complete nonparametric point of view introduced by Ferraty 
and Vieu (2002). We consider the former approach hereafter: see section 2, for the definition of 
the functional linear regression. Contrary to the multivariate linear regression where the vector 
of parameters is identifiable provided that the covariance matrix is non-singular, identifiability 
of the functional coefficient is not ensured unless a sufficient and necessary condition is satisfied 
(see section 2). 

Different estimators for the functional parameter have been considered in the functional 
linear regression model: see for instance Goutis (1998), Cuevas et al. (2002) and Cardot et al. 
(1999, 2003). Upper bounds for the rates of convergence have been found and these results 
show that the transition from the finite dimension to the infinite dimension leads to degenerate 
rates of convergence. As a matter of fact, estimating the functional parameter appearing in 
the functional linear regression can be viewed as an ill-conditioned inverse problem since the 
estimation procedure relies on the inversion of the covariance operator which is compact (Cardot 
et al. 1999, Bosq 2000, He et al. 2003) and it is well known that poor rates of convergence 
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appear for this kind of problems. However the problem of approximating inverses of covariance 
operators or of selfadjoint compact operators is not new. It is adressed in Nashed and Wahba 
(1974), Arsenin and Tikhonov (1977), Groetsch (1993) among many others. The main point is 
always to regularize a matrix M (respectively an operator S) which is invertible but "not by 
much" (respectively unbounded). This property implies that for any vector x, Mx (respectively 
Sx) may have large variations even when x does not vary much. Numerous procedures were 
proposed. Such procedures appear especially in image analysis or deconvolution or in specific 
M-estimation problems for instance. 

The Central Limit Theorem for i.i.d. Hilbert valued random variables play a central role in 
deriving the main results of this paper. The monograph by Araujo and Ginc (1980) or Chapter 
5 in the book by Ledous and Talagrand (1991) deal with this crucial theorem of probability 
theory and provide deep studies about the CLT on infinite dimensional spaces. For the non 
independent case we also mention two recent works from Dedecker and Merlevede (2002) and 
Merlevede (2003). 

In section 3, we consider a class of regularization methods for inverting the covariance op- 
erator that leads to a quite general class of estimators with the aim of investigating GLT for 
prediction and as a by-product producing confidence sets for prediction. Section 4 is devoted to 
the asymptotic behavior of these estimators relaxing as much as possible the set of assumptions 
(moment assumptions, assumptions on the spectrum of F) and considering a large class of reg- 
ularizing methods for inverting the empirical covariance operator. We first derive an important 
result which shows that it is not possible to state a CLT for the functional coefficient with re- 
spect to the norm topology of the functional space. Nevertheless, we show that it is possible to 
get a CLT if we consider the behavior of the predictor with respect to the weak topology, that is 
to say for point-wise prediction. We show that the results depend on the nature of the predictor 
and fixed or random design lead to different CLT. Whereas when the predictor is random it is 
not possible to reach a parametric rate of convergence, this rate can be obtained depending on 
the value and the smoothness properties of the fixed predictor: we obtain a parametric rate for 
pointwise convergence at x wherver x belongs to the reproducing kernel Hilbert space associ- 
ated to the covariance operator. The proofs depend heavily on perturbation theory for linear 
operators to get, as accurate as possible, approximations of the eigenelements of the empirical 
covariance operators. Similar methods based on functional calculus have been used for deriving 
asymptotic properties of the functional principal components analysis by Dauxois et al. (1982), 
Kneip (1995) or Kncip and Utikal (2001). Section 5 proposes a brief discussion about possible 
extensions and statistical applications of these results. Finally section 6 is devoted to the proofs. 
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2 Functional linear regression 



We consider a sample (Xj, Yi),i = 1, . . . ,n of independent and identically distributed random 
variables drawn from a pair {X, Y). The variables X and Y are defined on the same probability 
space and Y (the response) is valued in M. It is usual to define the functional variable X (the 
predictor) as a random variable taking values in a general real separable Hilbcrt space H with 
an inner product denoted in the following by (., .) and an associated norm denoted by ||.||. As 
a matter of fact H may be the Sobolev space W"^'"^ (C) of functions defined on some compact 
interval C of M having m square integrable derivatives, m being a positive integer. In that case 
the inner product (., .) is the usual inner product on this space i.e. 

m „ 

= E / f^\^)9^\^)dx, f,9eH. 

Note that this special case is particularly interesting for modelling situations where we have 
functional data as shown by the numerous applications given in Ramsay and Silverman (1997, 
2002). Although we develop below theory for general Hilbertian random variables, we keep in 
mind this special situation and then use the word functional variable to qualify X. 

In the following we assume that lEY'^ < +oo and that X is a iJ-valued random variable 
such that 

{H.l) -^(ll^f ) < +00. 

Then X is of second order and one can define the expectation of X, namely 1E{X), that we 
suppose in order to simplify the notations to be the null element of H {1E{X) = 0). Moreover 
the covariance operator of X is defined as the linear operator F defined on H as follows 

Fh = IE{X(^X{h)) = lE{{h,X)X), heH. 

It is known that F is a self-adjoint, positive and nuclear operator hence it is Hilbert-Schmidt 
and hence compact (Dauxois et al, 1982). We denote by {Xj)j the sorted sequence of non null 
distinct eigenvalues of F, Ai > A2 > . . . > 0, and {ej)j a sequence of orthonormal associated 
eigenvectors. We assume that the multiplicity of each \j is one (remind that since F is compact 
the multiplicity of each Xj 7^ is finite). We could consider the more general case of multiple 
eigenvalues without affecting our forthcoming results but the price would be more complicated 
proofs and a poor gain with respect to the main objectives of the paper. Let us also define the 
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cross-covariance operator of X and Y as the functional A defined on H by 



A/i = ]E{X®Y{h)) = lE{{h,X)Y), h £ H. 

Now, we aim at considering the functional linear regression of the variable Y on X. This 
means that we are seeking the solution p £ H the following minimization problem 

ini 1E{\Y-{P,X)\'). (1) 

When a solution p exists and is uniquely determined, we can write 

Y = {p,X) + e, (2) 

where e is a centered real random variable with variance such that E{eX) =0. It is quite 
easy to show that it is equivalent that p satisfies equation (j2)) and that it satisfies the following 
moment equation 

A = r^. 

However, when the dimension of H is infinite, existence and uniqueness of p is not ensured since 
a bounded inverse of F does not exist: we need an additional condition to get existence and 
uniqueness of p, namely 



Condition U. The variables X and Y satisfy 



Under condition U, Cardot et al. (2003) show that a unique solution to equation (jSJ exists 
in ((i^er(r))^ and that this solution is of the form 

_ ^ {lE{XY),ej) 

Then, identifiability is true only in {{Ker(T))-^ or in other words the set of solution of © is of 
the form p + Ker{T). Again, to simplify further developments we assume from now on that the 
following condition is satisfied 

{H.2) Ker{r) = {0}. 
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Finally, we assume from now on that the first and second moment of e given X are respectively 
equal to W,{e\X) = and lE{e'^\X) = u^. 



3 Inverse problem and regularization procedure 

Once we get identifiability through condition we turn to the problem of estimating the 
"functional" parameter p from the sample (Xj,!^), z = 1, . . . ,n. The first step is to define the 
empirical versions of V and A which are 



We have 

where Un = J27=i -^i ® taking the expectation we get 

iE(An) = A = Tp. 

As shown in the previous section, inversion of F can be viewed as a kind of ill-conditioned inverse 
problem (unlike in usual ill-conditioned inverse problems the operator F is unknown). Also, the 
inverse of F„ does not exist because F„ is almost surely a finite rank operator. As usually for 
ill-conditioned inverse problem we need regularization and our aim is now to propose a general 
and unified method to get a sequence of continuous estimators for F~^ based on F„. 

The method is theoretically based on the functional calculus for operators (see Dunford and 
Schwartz, 1988, or Gohberg, Goldberg, Kaashoek, 1991, for instance). 

For further purpose we first define the sequence (6j)j of the smallest differences between 
distinct eigenvalues of F as 



Now take for {cn)neN a sequence of strictly positive numbers tending to zero such that c„ < Ai 
and set 




i=l 



i=l 




kn = sup {p: Xp + Sp/2 > Cn} . 



(3) 
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Then define a class of sequences of positive functions {fn)n with support [cn, +00) such that 

(F.l) fn is decreasing on [c„, Ai + 5i], 

(F.2) Mm sup \xfn{x) - 1| = 0, 

(F.3) /ra(^) exists for x G [c„,+cxd). 

Moreover, we will make in some cases the additional condition below which will be helpful to 
reduce the bias of our estimator 

(H.S) sup |s/„(s) - 1| = o{l/V^). 

s>c„ 

Now WG d-Gscribe prcicticSjlly the rGgul8.rizB,tion procGdure. Th.G GigGnvaluGS of P-^ are denoted 
by Xj and the associated eigenvectors by ej. The bounded linear operator Tn is defined the 
following way 

• Choose a threshold c„, 

• Choose a sequence of functions (/«)„ satisfying (F.1)-(F.3), 

• Compute the (functional) PCA of r„ (i.e. calculate the eigenvalues Xj and the eigenvectors 



• Compute the finite rank operator Tn with the same eigenvectors as r„ and associated 



eigenvalues fn{Xj) \^i.e. Ti = YTj=i fn (^Aj j ej ejj . 

Obviously c„ must be larger than the smallest significatively non-null eigenvalue of r„. Once 
the threshold c„ and the function /„ (both depending on the sample size n) have been chosen, 
we see that the computation of the estimator of p is quite easy through the relation 

P = 4A„. (4) 

Now let us give some examples of functions /„, and the derived estimators of p. 

Example 1. If fn{x) = 1/x when x > Cn and elsewhere, condition {H.3) holds and Tn is 
obtained by simple spectral truncation with threshold c„. The operator TnTn is nothing but the 
projection on a finite dimensional space. Note however that the random dimension of this space, 
say dn, is not necessarily equal to kn (see (jSl): for instance we may be in the situation where 
Xk„+i > Cn and then dn > kn + 1. Unlike dn, kn is non random and was introduced because, 
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as will be seen in the proofs, P {dn 7^ kn) tends to zero fast enough to consider essentially the 
situation when dn = kn- In other words the derived estimator for p is asymptotically equivalent 
to the one considered in Cardot et al. (1999). 

Example 2. Let be some scalar parameter. If fn{x) = + when x > c„ and 
elsewhere, we get a ridge-type estimator. Condition (H.3) is satisfied whenever a„y^/c„ — > 0. 

Example 3. Let q„ be some scalar parameter. If /n(x) = x/{x^ + a„) on its support, Tn is 
nothing but the Tikhonov regularization of r„. Once more (H.3) holds if an^fnlc?^ — 0. 

We may define as well, following Mas (1999), a class of approximate for r„ introducing 
fn,p{x) = I (x + anf^^ or fn,p{x) = x^ / {x^~^^ + a„), where again a„ is some scalar parameter 
and p some integer. 

This procedure is quite general to define regularized version or pseudo inverses for r„. Up 
to the authors knowledge, all standard techniques for regularizing ill-conditioned matrices or 
unbounded operators stem from the above functional calculus. 

4 Asymptotic results 

In this section, we mainly announce weak convergence results for the statistical predictor of l^+i 
for a new value Xn+i obtained by means of estimator defined in Q), namely y^+i = (p, -^n+i)- 
Hence, we should study stochastic convergence of 

{p,Xn+l) - {p,Xn+l). (5) 

We also look at prediction for a given value of x & H and study the stochastic convergence of 

{p,x) - {p,x). (6) 

It is important to note that all the results are obtained without assuming any prior knowledge 
for the rate of decay of the eigenvalues Xj of T to zero. We will see that unfortunately a bias 
term appears which cannot be removed without very specific assumptions on the sequence on 
the spectrum of T and on the smoothness properties of p. 

We begin to investigate the weak convergence for the norm topology on H for our estimate. 
The next and important result underlines the limits of the functional approach. It tells us that it 
is not possible to get a general result that would allow to build confidence sets in the functional 
setting. This highlights the fact that when considering functional data one must take care and 
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multivariate classical results are not necessarily true anymore. 

Theorem 4.1 It is impossible for p — pto converge in distribution to a non- degenerate r.e. in 
the norm topology of H . 

The proof of the Theorem is postponed to the end of sectional it is shown actually that for 
any normalizing sequence j +00, a„ {p — p) does not converge in distribution for the norm 
topology but to a degenerate random element. 

Nevertheless this negative result does not mean that it is not possible to get some confidence 
sets. We have to consider a weak topology (with respect to the inner product), that is to say 
point-wise confidence bands, and study separately the cases of deterministic and random points. 
We first give results for the prediction approach. 

We define as fn (r). It is important to note that depends on the sample size n through 
the sequence From this we take in the following 
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and in^x and s„ their empirical counterparts based on the Aj's. Note that the sequence tn^x may 
either converge or diverge depending on whether Yl'j^=\_ ^^J^ ^ ^j)"^ ~ \\^~^^'^-^\\^ finite or not 
(i.e. whether x is in the range of T~^^'^ or not). At the opposite, the term s„ always tends to 
infinity. 



4.1 Weak convergence for the predictor 

We state a weak convergence theorem for the predictor given in We denote by H^^ the 

projector onto the eigenspace associated to the kn first eigenvalues, and by its empirical 

counterpart i.e. the projector on the eigenspace associated to Ai, A2, A^^. 

Assumptions (H.l) — {H.3) are truly basic. They just ensure that the statistical problem is 

correctly posed. In order to get deep asymptotic results we introduce extra assumptions denoted 

(A1)-(A3). 

+00 

(Al) ^\{p,ei)\<+^. 

1=1 



9 



(^.2) There exists a convex positive function A, such that, at least for j large, Xj = A (j). 

We recall the Karhunen-Loeve expansion of X, that is 

+00 

1=1 

where the ^;'s are centered r.r.v such that E^i^i' = 1 if Z = Z' and otherwise. We assume the 
following assumption for variables 

{A.3) supE^f < M < +00. 

I 

Remark 4.1 Assumption {A.2) is clearly unrestrictive since it holds for standard rates of de- 
crease for the eigenvalues, polynomial or exponential. It implies that 

Sk = min (Afc — Afc+i, Ajk_i — A^) = A^ — Xk+i- 

Remark 4.2 Simple calculations show that assumption {A.3) implies assumption (H.l), namely 
that E ll^ll^ < +00 and does not require any condition on the stochastic dependence within the 
(,l's. Besides (A.3) holds for a very large class of real-valued random variables (remind that the 
's are subject to E^; = and E^^f = Ij- 

Theorem 4.2 When assumptions {H.2) — (H.S) and (A.l) — {A.3) hold and if 

kn^^ (logfcn)^ _^ Q 

then 

^((p,X„+i)-(n,„p,X„+i)) A iV(0,a2). 



s. 



n 



Corollary 4.1 // either sup^ ep)|p^/^) < +00 or i/ supp (p^Ap) < +00, the bias term 
(Ilfc^p, in the previous theorem can be replaced with {p,Xn+i). 

Remark 4.3 The term Sn always tends to infinity and hence we cannot obtain a "parametric" 
rate of decay in probability. Besides Sn depends on the unknown eigenvalues. It is worth trying 
to get an "adaptive" version of the above result: replacing the Aj 's with the Aj 's leads to a new 
result with both a random bias and a random normalization term. 
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Corollary 4.2 The adaptive version of the previous Theorem is 



((p,x„+i)-(nfc„p,x„+i))^iv(o,i) 



SnCe 

where 



Remark 4.4 /n all the previous results, the variance of the white noise a1 is unknown. Replac- 
ing (Te with a convergent estimate of does not change the Theorems. 

4.2 Weak convergence for the estimate of p. 

We are now giving weak convergence results for the prediction at a given value x id. H. 



Theorem 4.3 Fix any x in H. When the assumptions of Theorem \4-.^ hold and if 

sup — < +00 and — — > 0, 

P Ap tn^xy/n 



then 

In 



{p,x)-(fLk,,p,x)) ^NiO,l). 



Remark 4.5 The bias term here is random. It can be seen from the proof of the crucial Proposi- 
tion \6.1\ that the situation cannot be improved without very specific (maybe artificial) assumptions 
either on p or on the Aj 's. 



yf n 

The normalizing sequence depends on the unknown Aj's. It is worth trying to get again 

tn,x 



an adaptive version of the above theorem (i.e. replace tn,x with tn,x = y Z]^=i Aj fn ^Aj^ (x, e 
Corollary 4.3 Theorem\4.cl\ still holds if tn,x is replaced with its empirical counterpart tn,x- 



The following Remark is crucial since it brings out once more what seems to be a typical 
feature of the functional setting. 

Remark 4.6 As seen before the sequence tn^x may either converge or diverge. Indeed, i/||r^^/^x|| 
is finite the normalization sequence grows surprisingly at a parametric rate (i.e \fn). This could 
be understood as an extra- smoothing of the estimate p through the integrals involving the scalar 
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product. But in terms of prediction this fact could be misleading. This "extra- smoothing" is in- 
deed an excessive and artificial smoothing since P {\\T~'^/^Xn+i\\ < +00) = 0. This also means 
the realizations of X do not belong with probability one to the reproducing kernel Hilbert space 
associated to its covariance function (Hajek, 1962). In other words the results of this section 
are given for the sake of completeness and to explore the analytical properties of our estimates. 
For these reasons and if prediction is under concern, only {p,Xn+i) should be considered and 
studied. In a multivariate setting all these considerations make no sense, since the situation is 
simpler (in fact, usually P (||r~-'^/^X„_|-i || < +cxd) = 1 because r~^/^ is bounded when T is a full 
rank covariance matrix). 

Within the proofs it is readily seen that assumption (^.1) plays a crucial role in getting a 
non random bias term. The next Proposition illustrates this situation. 

Proposition 4.1 Assume that Afc = that (x,efc)^ = k^^~^ with [5 > 1 + a. Then 

if {p, Cj)"^ = +00, the sequence ^ / ("llfc^ — Ilfc^") jO, x\ may not be bounded in 

probability even if the random variables Xi, i = 1, . . . ,n, are i.i.d. centered, Gaussian. 

Remark 4.7 The condition (3 > 1 + a just ensures that ||r^^/^x|| < +00. Besides if {p,ej)'^ = 
3~^~^ : Z]j=ii^~^ {p^^j)'^ diverges whenever /3 + 7 < 1 which implies that Yl^=i \{P^^j)\ — +00- 

In fact the assumption on the location of p mentioned in the Proposition should be under- 
stood as smoothness conditions. 



5 Concluding remarks 

One important application of previous results is the construction of confidence sets for prediction. 
In real life problems, the regression function p is unknown but Corollary 14. II allows us to build 
confidence sets. Let qa be the quantile of order 1 — a/2 of a Gaussian random variable with 
mean and unit variance, we get under previous assumptions the following confidence set for 
prediction, 

lim P |< p, Xn+i > - <p, Xn+i >\>qA = l-a . (7) 

n— »oo \a Sn J 

A simulation study (Cardot et al., 2004) has shown that such confidence sets are accurate even 
for moderate sample sizes, i.e. for n around 100. 

From a mathematical points of view, one of the main novelty of this work relies on the facts 
that no prior information on the eigenvalues is assumed and the dimension sequence kn does not 



12 



depend on the rate of decrease of these eigenvalues. As a consequence kn increase rather slowly, 
but not that much for a non parametric model. Nevetheless, let us notice that this situation 
may be significantly improved if some information on the eigenvalues is available. 

From Theorem 14. 2 1 it is possible to derive a general bound for the prediction error. Simple 
calculations (see the proof of Corollarv 14. 1|) lead to : 

(p-/>,X„+i)2 = Op(^)+Op( f; Xj{p,e,)A. (8) 

^ \j = kn + l J 

Thus, it is not possible to go further without imposing more precise hypotheses on the smooth- 
ness of function p with respect to the basis of eigenfunctions ej and the rate of decay of the 
eigenvalues Xj as remarked sooner in the article. Nevertheless, it was seen that the second term 
on the right in (jH)) can converge rapidly to zero in some situations. Besides assumption (^.1) 
provides us with some kind of uniformity with respect to p when the latter belongs to a subset 
of H. Naturally, with these remarks we have in mind the study of the minimax rate of risk 
for the class of our predictor. 



6 Proofs 

Along the proofs we suppose that (H.l) — {H.3) hold. The letter C will always stand for any 
(nonrandom and universal) positive constant. For any bounded operator T defined and with 
values in H we classically set 

ll^lloo = sup ||Tx|| , 

where Bi is the unit ball of H. We will quite often make use of the following facts. 



For any u in H, 



For a sufficiently large i, Xi < 



E{X,u)^ = {Tu,u) 
C 



ri/2, 



i log i 

The Hilbert-Schmidt norm is more precise than the classical norm for operators. Hence if 
T is Hilbert-Schmidt 



l^lloo < WTWhS 



2 

■Pll ! 



where {up)^^^j is any complete orthonormal sequence in H. 
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Prom definitions of p and Un we have 

from which the forthcoming decomposition is trivial 

p-Jlf^^p = Tn + Sn + Rn + yn. (9) 

where 



We also denote 



Ln = nfe„p - p. 



The proofs are tiled into four subsections. After a brief introduction on operator-valued analytic 
functions, we begin with providing useful convexity inequalities for the eigenvalues and subse- 
quent moment bounds. The second part shows that all the bias terms but say r„, Sn and 
Yn tend to zero in probability when correctly normalized. Weak convergence of Rn is proved in 
the short third subsection. The last part provides the main results of the paper by collecting 
the Lemmas and Propositions previously proved. 

6.1 Preliminary results 

All along the proofs we will need auxiliary results from perturbation theory for bounded opera- 
tors. It is of much help to have basic notions about spectral representation of bounded operators 
and perturbation theory. We refer to Dunford-Schwartz (1988, Chapter VII. 3) or to Gohberg, 
Goldberg and Kaashoek (1991) for an introduction to functional calculus for operators related 
with Ricsz integrals. 

Let us denote by Bi the oriented circle of the complex plane with center Aj and radius Si/ 2 
and define 

Cn = \jBi. 

1=1 

The open domain whose boundary is C„ is not connected but however we can apply the functional 
calculus for bounded operators (see Dunford-Schwartz Section VII.3 Definitions 8 and 9). We 
also need to change slightly the definition of the sequence of functions (/n)„ by extending it 
to the complex plane, more precisely to C„. We admit that it is possible to extend to an 
analytic function defined on the interior of C„ (in the plane) such that sup^-g^^ \ fn (^) < 



14 



Csup3.g[c„,Ai+5i] \fn{x)\. For instance if /„ (x) = {l/x)l[c„,+oo){x), take fn{z) = {l/z)lc^{z). 
Results from perturbation theory yield 

nfc„ = :^ / iz-r)-Uz, (10) 

2^^- Jc„ 

r^ = ^ / (z-r)-Vn(^)d^, (11) 

where = —1. 

We introduce also the square root of symmetric operators: if T is a positive self-adjoint 

1 /2 

operator (random or not), we denote by {zl — T) ' the symmetric operator whose eigenvectors 
are the same as T and whose eigenvalues are the complex square root of z — Afe, G N, denoted 
{z - Xkf' . 

Lemma 6.1 Consider two positive integers j and k large enough and such that k > j. Then 

jXj > kXk and Xj — Xk > ~ -^i- 

Besides 

J^Aj < (A: + l)Afc. (13) 

Proof. We set for notational convenience Xj = ip{l/j) where ip is, by assumption {A.2), a 
convex function defined on the interval [0, 1] such that that (/? (0) = and (p{l) = Ai. 
The two inequalities in H12|) follows directly from the well known inequalities for convex functions 

(fjxi) - (f{xo) (p{x2) - fjxo) (p{x2) - ipjxi) / / ^1 

< < , < Xo < 3^1 < X2 < 1, 

Xi - Xq X2 - Xq X2 — Xi 

and by taking xq = 0, xi = 1/k and X2 = 1/j. 

Set fik = J2i>k^i- ^^^y ^^^^ sequence {nk)k satisfies assumption (^.2). Indeed 

for all k 

which is a sufficient condition to construct a convex function fi (k) = ii^- We can then apply the 
second part of (fT2|l with /ifc+i instead of A^ and /ife instead of Xj, which yields 

fJ'k — fJ'k+i = Afc > -r—rfJ'k, 
k + 1 

and (|13j) is proved. ■ 
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Lemma 6.2 The following is true for j large enough 



E 



< Cjlogj. 



Proof. We are first going to decompose tlie sum into tliree terms 

A, 



E 



T1 + T2 + TS., 



where 



^-Ei^.^-E 1:^.^3^ E 



1=1 



— Ao — A/ 
i=j+i ^ 



A, — A; 

l=2j+l J ' 



Applying Lemma l6. II we get 



A; — \j ^ 7 — I 

=1 ' J 1=1 



where Ci is some positive constant. Also, applying once more (|T^ then ((T^ . we get 



2i 



A 



2i 



A; / 
Aj I - j 



A'j — A/ 

i=j+i ■' i=i+i ■' 

<2j V ^ : <C2jlogj, 

l=j+l 



and 



+00 



'^3< E 



< 



Xj — Xi Xj — X2j 



Hence the result follows and Lemma 16.21 is proved. 



Lemma 6.3 We have for j large enough 



E sup 



(zI-T)-'/^ (r„-r) (zI-T) 



-1/2 



2 C 2 

< — (jlogj) 

00 n 



(14) 



and 



E sup 



< Cjlogj. 



Proof. Take z & Bj. By bounding the sup norm by the Hilbert-Schmidt one (see above), we 
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get 



+00 +00 

^ E E ((^^ - ^y'^' (r„ - r) {zi - ry'/' (e,) , e. 



-1/2 



i=l k=l 

< 2 ((r„-r)(ei),efc)^ 



|z - Ail I2: - Afcl 



+00 



< 



, v;^ ((r«-r) (e;),efc) v;^ 



{{Tn-r){ej),ekf , ((r„-r)(e,-),e,) 



ifcj'i |Aj - Ail |Aj - Afcl 



k=l, 



6j \z - Afcl 



+ 



^1 



since it can be checked that whenever z = Xj + "^e'^ G and i j 



A, - A. + |e^^ 



\z - Ail = 

Besides 

E((r„-r) {ei),ek) 
when assumption (A. 3) holds. Finally 



>|Ai-Ai|-^>|A,--Ai|/2. 



n 



E({Xi,ekf{X,,eif) -{r{ei),ekf 



< AlXk, 

n 



E sup 



M 
< — 

n 



M 

n 



{zi - r)-i/2 _ _ p)-i/2 

XiXk 



+00 

E 



+00 



; fr;^ I Aj - Ai I I Xj - Xk I 5j ^J^^j I Aj - Ai I V -^i 



+00 

E 

k=l,kj^k 



Xk 



X 



+00 



|Aj — Ajfcl 



E 



Xk j_ f 



k=l,k=/=j 



|Aj-A/| \Sj 



It suffices now to apply Lemmas Ifi.ll and IH.2I to get the desired result. The same method leads 
to proving the second part of the display. Lemma is proved. ■ 



Lemma 6.4 Denoting 

£, (z) = [\\{zi - r)-i/2 (r„ - r) {zi - ry'/^ 



< 1/2, z€Bj], 
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The following holds 

(z/-r)i/2 (^/_r„)-i(z/-r)i/2 

where C is some positive constant. Besides 



< C, a.s. 



n 



(15) 



Proof. We have successively 

{zi - r„)-^ = {zi - ry' + {zi - ry^ (r - r„) {zi - r„)-^ 



hence 



and 



i + izi- ry^/^ (r„ - r) {zi - ry^/^] {zi - rf^^ {zi - r„)-^ {zi - vf^ = i. (17) 



It is a weh known fact that if the hnear operator T satisfies < 1 then J + T is an invertible 

and its inverse is bounded and given by formula 



(/ + T)"^ = / - r + 

From (fTK|) and (fT7|) we deduce that 

{zl-Vf^ {zI-Tny\zI-Tfl^ 



i + {zi-Ty^'^ (r„-r) {zi-Ty 



-1/2 



'^Sj{z) < 2, a.s. 



Now, the bound in (|15|) stems easily from Markov inequality and (|14)1 in Lemma Id.SI This 
finishes the proof of the Lemma. ■ 

The empirical counterparts of (|lfl|) and (|11|) -mentioned above- involve a random contour, 
say Bi, centered at Aj. It should be noted that these contours cannot be replaced by the i3j's 
since the latter may contain more than eigenvalues of r„. The aim of the following Lemma 
is to find sufficient conditions under which Bi may be replaced with Bi. In other words, we have 
to check that for a sufficiently large n the p*^ eigenvalue of r„ is close enough from the p^^ 
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eigenvalue of T. Before stating this first lemma, we introduce the following event 



VjG{l,...,fc„} 



< 1/2 



Lemma 6.5 // 



kl log kn 



0, th,e.n 



p 

where r„ is a random operator satisfying y/nrn —>■ in the operator norm. 

Proof. When the event An holds, the kn first empirical eigenvalues Xj lie in Bj and then 

=J— f (z- r„)"^ dz = -^ f (z- Tn)~^ dz. 

From this it is clear that 

/ (z — Fn)'^ dz = Uu + 1 /ic / (z — rn)~^dz. 

Denoting r„ = ^a^t^ — Jc ~ ^n)~^ dz, we see that, since - — {z — r„)~^ dz 



have for £ > 



It remains to find a bound for P {An)- We have 



1, we 



■(^;;)<^p(|a,-a,|>V2) 



i=i 



2 ^A,V^^ 



A. 



(18) 



In order to get a uniform bound with respect to j of the latter expectation we follow the same 
arguments as Bosq (2000), proof of Theorem 4.10 p. 122-123. In Bosq, the setting is quite more 
general but however his Theorem 4.10 ensures that in our framework the asymptotic behaviour 
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of 



Aj - Aj 



is the same as yjn- 



((r„ — r) e,-, e,-) 

— - — - — . From assumption (A. 3), we get 



^ E|((r„-r)e,-,e,-)| 
yn 



A. 



< 



E 



a; 



(19) 



where C does not depend on j. From H18|) and H19() we deduce, applying Lemma l6. II once more, 
that 



C 



C 



C 



from which the result follows. ■ 

It may be easily proved that the same result as in the preceding Lemma holds with Tn instead 
of Ilfc^ . From now on we will implicitly work on the space An and then write 



n 



h 



and 



^1 = (il<^-''»>"^^' <-■>"-■)■ 

We will also abusively denote nfc^l_4^ by H^^ and r1'l_4^ by F^^. 

Remark 6.1 In fact thanks to Lemma 1 6'. 51 we can deal with all our random elements as if 
almost surely all the random eigenvalues were in their associated circles Bj. The reader should 
keep this fact in mind all along the forthcoming proofs. The condition on k^ needed on the 
Lemma is clearly weaker that the ones which appear for the main results to hold. 



6.2 Bias terms 

As announced above this subsection is devoted to the bias terms 5„ , r„ and Yn . A bound is 
also given for L„ for further purpose. We first begin with the term r„ for which we have the 
following lemma. 

Lemma 6.6 // {H.3) holds 
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Proof. Obviously rj^r„ — is a self-adjoint random operator whose eigenvalues are the 



1 1 and otherwise. So we have 

i<i<fc„ 



rt r„ - Uk„p 



<Csup(|s/„(s)-l|). 

°° S>Cn 



If assumption {H.3) holds, the last term above is an o{l/-\/n), which proves the second 
equality. ■ 

Lemma 6.7 The two following bounds are valid 



n „ , , Xk In 



kn V log kn ' 



l>kn+l 



Proof. We have 



E|((/-nfcJp,X„+i)|< /E E {p,eif{Xn+i,eiy 

V l=k„+l 



Y l>k„+l 
^ < h 



. VK log A;^ 

since A; and e;)| are absolutely summing sequences. ■ 

Proposition 6.1 // -^kn'^ (log A;„)^ —>■ as n goes to infinity, then 
■s/n 



21 



Proof. The proof of the Proposition is the keystone of the paper. We begin with 



Iffc -lit 



L [(-^-r,)-'(r,.-r)(./-r)- 



1 



dz 



where 



and 



1 



= ^ E / [('^ - (r„ - r) {zi - T)-' 
j=i 



dz. 



T^n = ^Y. I [(^^ - r)"' (rn - r) {zi - T)-^ (r„ - r) {zi - r„)~^ 



dz. (20) 



Result ()26() below will provide us with a sufficient condition for TZn to be negligible. At first, we 
turn to Sn- We have 



+ 00 



(+00 \ / +00 

since E {{Xn+i, ei) {Xn+i, e/')) = if / / /' and Xn+i is independent from 5„. Now 

/+00 \ 2 

E {SnP, eif = E {p, SnCif = eI^{p, ev) {SnCi, ei>) \ . 

The operator 5„ was explicitly computed by Dauxois et al. (1982). More precisely 
1 



27rt 



{Zl - T)-^ {Tn - T) {Zl - T)~^ dz = Vj {Tn - r) TTj + TTj (T^ - T) V 



•31 
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1 



with Vj = ^jt^j T ^^i' where tTj is the projector on the eigenspace associated to the 



Xji - Xj ^ 
eigenf unction of F. Hence 



{Sneueii) = ^ [((r„ - r) -KjeuVjeii) + ((r„ - T) VjewKjev)] 

if (/' < kn and / < kn) or if (/' > /c„ and I > kn), 



A/' — A; 

((rn - r) ei,ei> 
A; - Xi' 

FinaUy, if we take for instance I < kn 



if I' < kn and I > kn- 



\l'>kn + l ' / 

n +00 

j=l l'>kn + l 



Xi' — Xi 



where 



and the 



+00 



l'>kn+l 



{{Xj(^X,~T)ei,ei>) 



A;' - A; 

are centered and uncorrected random variables. Hence 



\ j=i / \V>k„+l 



((Xi,e;))((Xi,e,) ) 

Ar — A; 



Since / < A;„ < Z', by using the Karhunen-Loeve expansion of Xi, we get 



+00 



x73T = \^'^'') 



/'>fen+l 



A;' - A; 



'>k„+l 



A;' - A; 
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and then 



+00 



((Xi,eO)((Xi,er)) 



+00 



2^ {p, ev ) {p, em) -j7 r^TT ry • 



A;' — A; 

By applying twice Cauchy-Schwarz inequality to the ^fc's and under assumption {A.3), we get 



Summing up what we made above we get 



+00 



J'>kn + 1 



A/' — A; 



Az' — Xi 



Remember that we had fixed I <kn- Now, if we take / > kn similar calculations lead to 



At last 



(^71 
Z'=l 



((Xi,eO)((Xi,ep)) 
Az' — Az 



Z'=l 



VAzA;/ 

Az' — Az 



^E(5„/),X„+i)^ < Az I ^ (/0,ez'' ^^^^^ 



1 = 1 \V>kn + l 



Az' — Az 



We apply Lemma IHTD first to bound 



«=1 \/'>fc„+l 



kn 



1=1 



l'>kn+l 



" z=i y'>fc„+i ^~ Ji ) 



(22) 
(23) 
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Now we set /i„ 



log kr, 



where [n] , It G M, denotes the largest integer smaller than u. Note 



that the last inequality in the display above may be split as follows 



+CX) 



2M 















1=1 


\l'>kn+l 


1 




' +00 












il'>kn+hn 





T'J 



1 - 



(24) 



Dealing with the second term we get for V >kn + hr, 



1 _ _ > 1 2 — = H: — ^ 



and hence 



+00 _ +00 ^ , 



+00 



< E I (P' (1 + V^n log A;„^ . 



Now obviously 



sup v^A^ 

'>kn+h n 



since \/\id' \ogl' from which we deduce that 



2M^ — r- I 



=1 \ l'>k„+hn 



1=1 \l'>k„+h„ 
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When assumption {A.3) holds, Cesaro's mean Theorem ensures that the term on the left above 
tends to zero. We turn to the first term in equation H24|) 



+h 

E 

y'>k„+i 



7i + /ln ^ 

\{p,ei')\V>H' 



\{p,ei')\y/Xi 



< max 

fc.n + l<«'<fc„+/l„, 

l<l<k„ 



2M 



Now Xk„kn as well as A;^ max/j^+i<;/<fc^_|_/i^^ {\{p, ei')\) tend to zero when assumption (^4.3) holds. 
We get once more 

/ \ 2 

E \{P,ei')\V>^ ^ -0. 

1=1 Y>k„+i ^~ ji J 

A similar truncating technique would prove that the term in ^l'6\i also tends to zero as n goes 
to infinity which leads to 

(25) 



n o 



In order to finish the proof of the Proposition we must deal with the term introduced in H2U() . 
We have the following result 



(26) 



kl log kn 
when - 



n 



0. Indeed, consider 



(zi - r)-i (r„ - r) {zi - ry^ (r„ - r) {zi - r„)-^ 



dz. 



Then setting 



Gn (z) = {Zl - r)-l/2 _ _ p) 



-1/2 



26 



we have 



|(rj-„p,x„+i)| (27) 
/ ((z/ - r)-^/2 _ _ p)-i (p^ _ p) _ p^)-i _ p)-i/2 ^^^\^ 

< I \(gI{z){zI -Tf'\zl -Tn)-\zl -Tf'\zl -T)-^'^ p,{zl -T)-^'^ X„ 

<[ \\Gniz)\\l [zI-rf/'izI-Tr^r'izI-r)'' 



{zI-T)-'^'Xn+i {zl-Vr'/'p 



Following Lemma 16.41 the random variable {zl — F)^''^ {zl — F„) ^ {zl — F)^^^ 
posed in two terms 



is decom- 



{zI-Tf/^ {zI-Tn)-\zI-Tf/^ 
On the one hand when £j [z) holds it was proved in Lemma 16.41 that 



||(z/-F)|L izI-T^r' 



< C. 



On the other hand when if? (z) holds we may write for all r/ > thanks to bound ()15() 



M 



which entails that 



(28) 



Consequently W6 Ccin dccil with all Tj_^.fi as if the event £j {z) -hence the bound p8|l- holds almost 
surely. We take expectation and note that G„ {z) and X„+i are independent 



E|(T,-„/>,X„+i)| <C f E||G„( 

JBi 



{zl -T)-^'^ Xn+i {zI-T) 



-1/2 



(iz. 
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By Lemma 16.31 we have 



C 



E|(r,-„p,X„+i)| < -diam(^,) • (jlogjf sup {zl - Vy'l' p 



n 



-1/2 



<CV'^,-(ilogjf/'||p|| <C(jlogj 



since < C (jlogj) at least for a sufficiently large j. Finally summing over all the j's from 
1 to kn leads to 



j -| /~i 

^^E|(7^„p,X„+l)| < J^(jlogj)2 < (log 



which proves (|26() and achieves the proof of the proposition. ■ 

The methods used to prove the next Proposition are close to those developed above. 

1 5 /2 2 

Proposition 6.2 If —^kn (logA;„) — > 0, then 
\/n 



yf|((r*-rt)t;„,x„« 



Besides if x is a fixed vector in H such that supp — ''—^ — < +oo and 



kl (log fen)' 



0, 



0. 



Proof. Once again we develop the expression above by means of complex integrals for 
operator-valued analytic functions. Hence 



4 - rt = ^ /■ /„ (z) [{zi - r^r' (r - {zi - ry' 



dz 



E ^ / (^) [('^ - Tn)"' (r - r„) (z/ - rr' 
j=i -^e. 



and 



Ti-rA [/„„x 
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where 



H 



J-.n 



fn (z) {{zi - rf^ {zi - r„,)-i {zi - vf'^ Gn {z) {zi - vy'/^ Un, {zi - T)-'/^ X^+l) 



dz. 



We copy verbatim the arguments used to bound H27() : first of all we reintroduce the operator 
Gn (z) below and 

{zi-rf^ {zi-Tn)-'{zi-rf\ 

remains almost surely bounded by a constant which does not depend on n or j plus a negligible 
term as was proved just below (|28j) . Hence 



dz. 





fn{z) 


\\Gn {Z)\\ 









We take expectation 

E//j> <c |7„ (z)| E (\\Gn {z)\\ \{zi - r)-i/2 ^^ip ^ 11^^^ _ r)-i/2^^ 



dz 



<Cdiam(i3j)sup fn{z) ^ {zl -Vy^'^ X^+i a/e ||G„ (^)f a/e [zl-Vy^'^Un 



where Cauchy-Schwarz inequality was applied. Now invoking Lemma l6 . 31 vields 
,diam (Bj) 



EHj^n < C- 



(jlogj)^/^ sup 
n zeB, 



fn{z] 



E 



{zi-vy^/^u, 



Obviously 



hence 



At last 



E 



(zi-Ty^''^Un = ^E||(z/-r)-^/^Xi 

n II 

\ - A; 



2 al 



-1/2 



^ k- A/I' 



sup I ^/E 

zeB. 



{zi-vy^/^Un I <^(jiogjy/2 



EF,.„ < C-^ (jlogjf < - (ilogjT 
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and 



E 



4 - rt Un, Xn+1 ) < -kl (log K) 



c , 



n 



which proves the first part of the Proposition. Replacing with a fixed x in if, means 



replacing E 



(z/-r)-i/2^„+i 



{zI-T)-^''^x 



with 



< 



E 



+00 , 



1 1^ - -^pl ' 



p=i ' - V ^ \ p= 

and the derivation of the second part of the Proposition stems from the first part. 



6.3 Weakly convergent terms 

This subsection is quite short but was separated from the others for the sake of clarity and in 
order to give a logical structure to the proofs. 

Lemma 6.8 We have 



and 



Proof. We have 



J^{R^,x)^N{0,al),xeH, 

V 

1 " 

{Rn, X) = (rtf/„, x) = -Y, (r^^i, x) Si, 



1=1 



which is an array - implicitly depends on n - of independent real r.v. The Central Limit 
Theorem holds for this sequence and leads to the first announced result. We turn to the second 
display 



i=l 



Denoting j^j the cr-algebra generated by {Xi,£i, ...,Xj,£j), we see that^j ^ 

is a martingale dif- 
ference sequence w.r.t. J^i. Also note that 
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and that 



E 



E 



i=i 

Applying the Central Limit Theorem for real valued martingale difference arrays (see e.g Mc 
Leish, 1974) we get the second result. ■ 

6.4 Proofs of the main results 

The careful reader has noted that within the preceding steps of the proofs was replaced with 
kn in the normalizing sequence. Very simple computations prove that under (H.2) and if knj \fn 
tends to zero this permutation is possible (it is enough to prove that kn > Cs'^ for some constant 
C). 

Proof of Theorem 14.21 

The proof of Theorem 14.21 stems from the decomposition Q, Lemma l6.6| Proposition 16.11 
Proposition 16.21 and from Lemma 16.81 
Proof of Corollary 14. IL 

The proof of the Corollary is a straightforward consequence of Lemma 16.71 when choosing 



n 



Proof of Corollary 14.21 

\sl - 

It suffices to prove that " „ 



0, or equivalently that 



^'i=l ^jfn (Aj) - Xjfn ( Aj ) ( Xjfn (Aj) + Aj/„ ( Xj 



Efcl [A,/n(A,)]2 



0. 



Clearly since sup^^j^ 
get 



Op {l/^/n) and xfn{x) is bounded for x > c„ it is enough to 

^ 0. (29) 



Yl'i=l ^jfn (Xj) - Xjfn ( Xj 



Ylj=l i^jfn {Xj 



31 



But by assumption {H.3) 



J2 |A,/n (A,) - X,fn (a,) I < |A,/n (A,) - 1| + ^ |l - A,/„ (a, 
j=l j=l j=l 

= op {kn/Vn) , 



and kni \/n 0. 



Proof of Theorem 14.31 

Like Theorem l4.21 the proof of Theorem l4.3l stems from Q, Proposition 16 . 21 and from Lemma 

1 

Proof of Corollary 14.31 

We have to prove that 



fn \ 



{x,ejf - Xj [fn{Xj)f {x,ej)' 



i2 



E'=iA,[/„(A,-)]^(x,e,) 



We spht the expression into two terms 



E =1 A 



Wn2 



fn X:i 



Aj [fn{Xj)f ) {x,ejf 



E?=iA,[/„(A,)]'(x,e,f 



Ej=i>^AfniXj)r[{x,e,y-{x,e 



Copying what was done for the proof of Corollary 14. 2( we can easily prove that Wni 0. In 

p 

order to alleviate formulas and displays, we are going to prove that Wn2 ^ in the special case 
when fn{Xj) = l/Aj. The general situation stems easily from this special case. Thus, we have 
now 

E)=i {{x,e,f-{x,e,f) /A, 

W^n2 = — 



Ej=i(x,e,)VA, 



We denote by nj the projector on the eigenspace associated to the j eigenfunction of r„. Then, 
i nota 

we have 



with this notation, we can write (x,ej)^ — (a;,ej)^ = ||vrjx||^ — ||vrpx||^ = ((vfj — 7rj)x;x) and 



{x, Cj) {x, Cj) 



< vr,- — ttJI \\x\ 

— II J J 1 1 OO 1 1 I 
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1 

1 

27ri 



[zI-Tn)-' -{zI-T] 



-1 



dz 



and 



Finally 



(z/-rO"'(r„-r) {zi-ry' 

WW- II /^.^'logj' 

log kn Q 



I I ^ ^ 1 • 1 • / ^ log 



which finishes the proof of the Corollary. 
Proof of Proposition 14.11 

Take x = p = Yl Pi^i 1^ ^- Obviously, it suffices to prove that the Proposition 

holds when — Ilfc^ is replaced with 

v.fc„ ((r„ -T)) = Y, [Sj (r„ - r) n, + n, (r„ - r) s^] . 

Following Dauxois et al. (1982), p. 143-144, we can check that when Xi is Gaussian, ^/n (r„ — P) 
converges weakly to the Gaussian random operator G defined by 

j<j' 3 

where £,j^j'^s are i.i.d. Gaussian centered r.r.v. with variance equal to 1. Thus, we replace 
once more -y/n (r„ — F) with G (the situation is indeed the same as if the operator Xi ® Xi 
was assumed to be Gaussian). We are going to prove that ('^)^'^) is not bounded in 

probability, whatever the sequence kn — > +00, by choosing a special p. We focus on the jth term 
of the above sum. 

The exact computation of < (JljGSj + SjGIij) {x) , p > may be deduced from Dauxois et al. 
(1982) p. 146. Assuming that all the Aj's have all the same order of multiplicity equals to 1, we 
easily get 

{XjPl + XiPj)ijl. 



< {lijGSj + SjGIlj) (x) , p >= ^ 



Al — A; 
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The previous sum is a real centered Gaussian random variable with variance 

—^{XjPl+XlPj) . 

Summing over j provides the variance of {ipk„ (G) p, x) 

E E (-m + -mf > E E 7r\7 ^ E E 73^- 

j=i i^j (^i - j=i i^j (^i - j=i 1=1 i^j - ^0 

For the sake of simphcity we assume that Xk > and pk > 0. Now if xf = and A/ = 

the computation of the second sum stems from 



tl X^xf P-^ s^-f^ 



y—^^- — — — 2ds-cj 



, s 

1 



_ ^ 2+a-P 



Finally 

EvIE77^2c^Ej-'-*^>I-+- 

j=l 1=1 V^J - '^'J j=i 
We see that the variance of ^ — ^— — ^ explodes and that this random variable cannot con- 

tn,x 

verge in distribution. 

Proof of Theorem 14.11 

From and all that was made above it suffices to prove that the Theorem holds with C/„ 
replacing p — Ilk„P- Now suppose that for a given normalizing sequence a„ > 0, a„[/„ converges 
weakly in the norm topology of H (the deterministic sequence a„ just depends on the random 
variables Xfc,^^). For all x in H , On {Un, x) converges weakly too and 

n 

Or. 



n 

i=l 



is an array of real independent random variable. Suppose that x belongs to the domain of F ^, 
namely that 

+00 , >2 

Eix, ej) 

J=l J 
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then 

1 

-j=Y,{x,,T^x) e, ^ N {Q,(3,al) , 

where Px depends on x and on the eigenvalues of F. Consequently = -y/n. Now if Ylij {x, Cj)^ /Xj 
is divergent, E ^(Xj, F'l'x)^ t +00 an {Un,x) cannot converge in distribution. This fin- 
ishes the proof of the Theorem. 
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